Application No. 1 0/809, 1 62 

Reply to Office Action of June 12, 2007. 

IN THE CLAIMS 
Please amend the claims as follows: 

1 . (Currently Amended): M e thod A method for pre-processing speech, in particular 
in a m e thod for r e cognizing sp ee ch, comprising the steps of: 
[[-]] receiving a speech signal; [[(S),]] 

[[-]] separating [[a]] an entire spectrum [[(F)]] of said speech signal [[(S)]] into a 
[[given]] number [[(N)]] of predetermined frequency sub-bands; (fi. , — 9 F^ h 

[[-]] analyzing said speech signal [[(S)]] within each of said frequency sub-bands; 
(Fi , F uh 

[[-]] th e r e by generating respective band-dependent acoustic feature data (©+r^7 
Q N ) for each of said respective frequency sub-bands (fi- , F N ), [[which]] the band- 
dependent acoustic feature data (Q+ , 0 N )-are being at least in part r e pres e nta tiv e for 
representative of said speech signal [[(S)]] with respect to a respective frequency sub-band; 
(F± , F N )j 

[[-]] deriving band-dependent likelihoods (b ^ b N ) for occurrences of speech 
elements (P ±, P ffl ) or of sequences thereof within said speech signal [[(S)]] based on said 
band-dependent acoustic feature data; (Q± , O n ) and/or a d e rivativ e th e r e of, 

[[-]] analyzing said speech signal [[(S)]] within said entire spectrum; [[ (F),]] 

[[-]] th e reby generating full-band acoustic feature data A (FBE F; FFBE; FBE F 
SSUB; Of Lss^a ), which are the full-band acoustic feature data being at least in part 
representative [[for]] of said speech signal [[(S)]] with respect to said entire spectrum; [[(F),]] 

[[-]] deriving a full-band likelihood (Bj^-Bssys) for occurrences of speech 
elements P ffl ) or of sequences thereof within said speech signal [[(S)]] based on said 

full-band acoustic feature dat a; and (FBE F; FFBE; FBE F SSUB; O elssub ) and/or a 
d e rivative thereof, 
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[[-]] deriving an overall likelihood [[(B)]] for occurrences of speech elements (P± ? 

7777-P m ) or of sequences thereof within said speech signal [[(S)]] based on said band-dependent 

likelihoods (b ^ and said full-band likelihood (Bff^-Bssw). 

2. (Currently Amended): The method according to claim 1, wherein characterized in 
that wh e n deriving said overall likelihood [[(B)]] includes combining said band-dependent 
likelihoods (b ±, b N ) ar e combin e d to a union model likelihood (Bu,mfcc) by determining 
[[the]] a number of uncorrupted frequency sub-bands of said frequency sub-bands (F ±, F N ), 
and adding all possible combinations of products of different the band-dependent likelihoods 
(bi ., b u) corresponding to the respective frequency sub-bands. 

3. (Currently Amended): The method according to claim 1 , charact e riz e d in that 
wherein the step of generating the band-dependent acoustic feature data comprises generating 
said band-dependent acoustic feature data (Qj. , On) compris e that include respective band- 
dependent mel-frequency cepstral coefficient features, which are based on mel-frequency 
cepstral coefficients and/or a d e rivative thereof derived from the respective frequency sub- 
bands (fi. , F m). 

4. (Currently Amended): The method according to claim 1, characterized in that 
further comprising: 

applying a predetermined broadband noise robustness technique is appli e d prior to 
deriving said full-band likelihood term-^ppi-Bssug). 

5. (Currently Amended): The method according to claim 4, charact e riz e d in that 
wherein the step of applying the predetermined broadband noise robustness technique 
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comprises applying said broadband noise robustness technique [[is]] based on a frequency- 
filtering technique. 

6. (Currently Amended): The method according to claim 4, charact e riz e d in that 
wherein the step of applying the predetermined broadband noise robustness technique 
comprises applying said broadband noise robustness technique [[is]] based on a method of 
sp e c tral spectral -subtraction. 

7. (Currently Amended): The method according to claim 1, charact e riz e d in that 
wherein the step of generating the full-band acoustic feature data comprises generating said 
full-band acoustic feature data (FBE F; FFBE; FBE F SSUB; O f ,ssub) comprise that include 
filter bank energy features (FBE F) , which are based on filter bank energies derived from 
said entire spectrum [[(F)]]. 

8. (Currently Amended): The method according to claim 1, charact e riz e d in that 
wherein the step of generating the full-band acoustic feature data comprises generating said 
full-band acoustic feature data (FBE F; FFBE; FBE F SSUB; O f ,ssub) compris e that include 
filtered filter bank energy features (FFBE) , which are based on fil t e r e d filtered filter bank 
energies derived from said entire spectrum [[(F)]]. 

9. (Currently Amended): The method according to claim 1, characterized in that 
wherein the step of generating said full-band acoustic feature data comprises generating said 
full-band acoustic feature data fFBE F: FFBE: FBE F SSUB: Op ssttb) compris e that include 
full-band mel-frequency cepstral coefficient features, which are based on mel-frequency 
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cepstral coefficients and/or a d e rivativ e th e r e of d e riv e d derived from said entire spectrum 
[[(F)]]. 

10. (Currently Amended): The method according to claim 1, charact e rized in that 
wherein the step of generating said full-band acoustic feature data and/or said band- 
dependent acoustic feature data comprises generating said full-band acoustic feature data 
(FBE F; FFBE; FBE F SSUB; O f,ssub) and/or said band-dependent acoustic feature data (©4-7 
— 9 On) compris e that include PLP-linear prediction filter features, which are based on PLP- 
linear prediction fil - t e r filter coefficients. 

1 1 . (Currently Amended): The method according to claim 1 , charact e riz e d in that 
wherein the step of generating the full-band acoustic feature data comprises generating said 
full-band acoustic feature data (FBE; FFBE; FBE F SSUB; O f , s s ub ) comprise that include 
spectrally-changed full-band mel-frequency cepstral coefficient features f e a tur e s (Of Lggw), 
which are generated by applying a method of spectral sub traction subtraction to said full- 
band mel-frequency cepstral coefficient features [[(Of)]]. 

12. (Currently Amended): The method according to claim 1, characterized in that 
further comprising: 

determining, using a probability estimator, said band-dependent likelihoods (b+rr^ 
b N ) and said full-band likelihood tern* (Bpp^sswi-B^ ^) are determined using a probability 
e stimator . 

13. (Currently Amended): The method according to claim 1, charact e riz e d in that 
further comprising: 
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deriving said filtered filter bank energies (FFBE) ar e d e riv e d from said filter bank 

energies [[(FBE)]] by subtracting (f(i)-f(i+l) — i£-4)} a first filter bank energy [[(FBE M )]] 

from a second filter bank energy [[(FBE;+i)]], wherein said first filter bank energy [[(FBEj_ 

i)]] corresponds to a first discrete frequency and said second filter bank energy [[(FBEj+i)]] 

corresponds to a second discrete frequency, lying two discrete frequency steps after said first 

filter bank energy [[(FBE M )]]. 

14. (Currently Amended): Sp ee ch A speech pre-processing system, in particular 
int e grat e d into a spe e ch proc e ssing syst e m, which is capabl e of p e rforming or r e alizing a 
m e thod for pr e proc e ssing sp ee ch according to claim 1 and/or th e st e ps th e reof comprising . 

means for receiving a speech signal; 

means for separating an entire spectrum of said speech signal into a number of 
predetermined frequency sub-bands; 

means for analyzing said speech signal within each of said frequency sub-bands; 

means for generating respective band-dependent acoustic feature data for each of said 
respective frequency sub-bands, the band-dependent acoustic feature data being at least in 
part representative of said speech signal with respect to a respective frequency sub-band; 

means for deriving band-dependent likelihoods for occurrences of speech elements or 
of sequences thereof within said speech signal based on said band-dependent acoustic feature 
data; 

means for analyzing said speech signal within said entire spectrum; 

means for generating full-band acoustic feature data, the full-band acoustic feature 
data being at least in part representative of said speech signal with respect to said entire 
spectrum; 
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means for deriving a full-band likelihood for occurrences of speech elements or of 
sequences thereof within said speech signal based on said full-band acoustic feature data; and 

means for deriving an overall likelihood for occurrences of speech elements or of 
sequences thereof within said speech signal based on said band-dependent likelihoods and 
said full-band likelihood . 

15. (Cancelled). 

16. (Currently Amended): Comput e r A computer readable storage medium, having 
embedded therein computer executable instructions, wherein the instructions, when executed 
by a processor, cause the processor to perform a method comprising: 

comprising a comput e r program product according to claim 15 
receiving a speech signal; 

separating an entire spectrum of said speech signal into a number of predetermined 
frequency sub-bands; 

analyzing said speech signal within each of said frequency sub-bands; 

generating respective band-dependent acoustic feature data for each of said respective 
frequency sub-bands, the band-dependent acoustic feature data being at least in part 
representative of said speech signal with respect to a respective frequency sub-band; 

deriving band-dependent likelihoods for occurrences of speech elements or of 
sequences thereof within said speech signal based on said band-dependent acoustic feature 
data; 

analyzing said speech signal within said entire spectrum; 

generating full-band acoustic feature data, the full-band acoustic feature data being at 
least in part representative of said speech signal with respect to said entire spectrum; 
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deriving a full-band likelihood for occurrences of speech elements or of sequences 
thereof within said speech signal based on said full-band acoustic feature data; and 

deriving an overall likelihood for occurrences of speech elements or of sequences 
thereof within said speech signal based on said band-dependent likelihoods and said full-band 
likelihood. 
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